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Jet substructure has emerged as a critical tool for LHC searches, but studies so far have relied 
heavily on shower Monte Carlo simulations, which formally approximate QCD at leading-log level. 
We demonstrate that systematic higher-order QCD computations of jet substructure can be car- 
ried out by boosting global event shapes by a large momentum Q, and accounting for effects due 
to finite jet size, initial-state radiation (ISR), and the underlying event (UE) as 1/Q corrections. 
In particular, we compute the 2-subjettiness substructure distribution for boosted Z — ► qq events 
at the LHC at next-to-next-to-next-to-leading-log order. The calculation is greatly simplified by 
recycling the known results for the thrust distribution in e + e~ collisions. The 2-subjettiness dis- 
tribution quickly saturates, becoming Q independent for Q > 400 GeV. Crucially, the effects of jet 
contamination from ISR/UE can be subtracted out analytically at large Q, without knowing their 
detailed form. Amusingly, the Q = oo and Q = distributions are related by a scaling by e, up to 
next-to-leading-log order. 



The Large Hadron Collider (LHC) is exploring a new 
regime where the collision energy far exceeds the masses 
of known standard model particles. At such energies, 
heavy particles such as W/Z bosons and top quarks are 
often produced with large Lorentz boost factors, which 
leaves their hadronic decay products collimated into a 
single energetic "fat jet". Jet substructure techniques 
extract information from these fat jets to distinguish 
boosted heavy objects from the QCD background of jets 
initiated by light quarks and gluons. Examples of vari- 
ables defined for this purpose include planar flow [3, Q , 
jet angularities Q , pull [jj , Af-subjettiness 0, [H[ , dipolar- 
ity |(|, and angular correlations 
boosted Higgs bosons 0, tops 
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versus gluon discrimination 11| , along with many beyond 



ilications to 
] and quark 



the standard model applications (see [12J, |13[ for recent 
reviews). Jet substructure measurements are underway 
at the LHC HHH, but to date, studies of the analyz- 
ing power of substructure variables have been limited by 
the use of leading-log shower Monte Carlo simulations. 
If higher-order QCD computations were available, one 
could use them to directly compare to experiments or 
test the accuracy of Monte Carlo simulations. 

In this paper, we develop a framework for performing 
jet substructure computations analytically, in the limit 
where the boosted object of interest has a large momen- 
tum Q. We find a mapping between global e + e~ event 
shapes — which have been calculated to high precision — 
and jet substructure variables in the large Q limit, treat- 
ing finite jet size, initial state radiation (ISR), and un- 
derlying event (UE) as 1/Q corrections. Concretely, we 
consider the jet substructure observable A^-subjettiness 
Tn 0, which is the subjet version of the global event 
shape A'-jettiness [l6j]. The ratio T/v/T/v-i is a robust 
probe for A^-prong decays [l7j . and compares favorably 
to other methods for boosted object identification. 

Here, we focus on 1- and 2-subjcttiness (71 and T2), 



which are relevant for LHC searches involving W/Z and 
Higgs bosons. We compute the distribution for the 
ratio T2/T1 from Z — > qq decays to next-to-next-to- 
next-to-leading-log (N 3 LL) order, using ingredients from 
higher-order calculations of the classic e + e~ thrust event 



shape [l8-24|. From a calculational point of view, the 
use of this ratio is crucial, since it has a finite limit when 
Q — > 00. We will show that our full subjet distribution is 
equal to the global distribution generated by the Z decay 
products, up to 1/Q power-suppressed corrections. The 
dominant hadronization corrections cause a shift which is 
encoded in a single Q-indcpendent parameter. We com 



pare our substructure calculation to Pythia 8.150 [25 1 
tune 4C and also use Pythia to demonstrate that the ef- 
fects from the jet boundary and from external radiation 
(i.e. ISR and UE) are suppressed by 1/Q, only entering 
at the 5% level for Q > 400 GcV. 

We begin by considering a fat jet of size R (clustered 
with anti-fey [26|) m a PP collision event. This jet should 
contain most of the Z decay products as well as some 
ISR/UE contamination. The jet momentum is Pj = 
^2j£jPj> where j runs over the four- vectors p 1 ^ within 
the jet J. The jet boost Q is defined as Q = \Pj\- To 
calculate A^-(sub)jettiness, we must specify a distance 
measure ji, ljl H,l27|, and we use the geometric measure 
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Tll,Ti2,...,njV 



min{ni -pj,n 2 -Pj, ...,n N -pj}. (1) 



Here, n^ 1 = (1, hi) are lightlike axes defined by the over- 
all minimization. The minimum inside the sum partitions 
the jet's constituents into subjet regions Ji, . . . , J/v, de- 
fined by the axes . For the A^-jettiness event shape, J 
is replaced by the entire event. 



For 1-subjettincss, 71 = min ^ 



■Pr 



which can 



also be written as the small component of the fat-jet mo- 
mentum, 71 = P + = n ■ Pj. If the jet contained all the Z 
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FIG. 1: Kinematics of boosted Z decay. 



decay products and nothing else, 71 would depend only 
on the Z boson momentum as: 



Ti = P+ = JQ 2 +m 



Thus, the difference 



At = 71 - 71 



(2) 



(3) 



measures how much the Z is incorrectly reconstructed. 
We will use At to correct for ISR/UE contamination. 

Turning to 2-subjcttiness, we first calculate the ratio 
T2/T1 including only the Z decay products, and then 
discuss how other effects can be systematically included. 
The distribution for the Z decay products is easily de- 
termined by boosting the Z rest frame distribution. At 
leading order, Z decays to a qq pair which go off back-to- 
back in the rest frame, at an angle 9 (the helicity angle) 
with respect to the boost axis as in Fig.[TJ For simplicity, 
we treat the Z as unpolarizcd with a fiat 9 distribution, 
but one could easily integrate over a different 9 distri- 
bution, for example for Ws coming from top decays Q. 
In the boosted frame, the Z momentum P% and the two 
daughter-quark momenta q± and q% are 

Pg = [e q ,0,0,q}, (4) 

- j-Eg — Q cos 9, —mz sin 9, 0, Q — cos 9Eq j , 

-|.Eq + Q cos 6*, m z sin#, 0, Q + cos&Eq j, 



Qi 



<h 



with Eq = m 2 z + Q 2 . The quark energies are E\ = 
\{E Q -Qcos9) and E 2 = \{E Q + Qcos9). 

For the relevant small T2 region, the subjet directions 
from the minimization in ([!} can be aligned with the 
leading-order quark directions Thus, we can take 



n" = (1,0,0,1), < = i<# ; 



^r<£, (5) 



where is the 71 axis and and n% are the T 2 axes. 
In terms of the subjet masses wii and energies Ei, 



T 2 = Pf + Pi 



2Ei 2E 2 ' 



(6) 



In the large Q limit, E 1 ~ Q sin 2 (6>/2), E 2 ~ Qcos 2 (6>/2), 
and 71 ~ m|/(2Q), while mi are Q independent. Thus 
the distribution of the ratio T2 /T\ asymptotes to a fixed 
Q-independent result. 

Now let us consider how the scaling with Q is affected 
when T2/T1 is considered in a realistic environment, such 
as at the LHC. A measurement of T2/T1 includes effects 
from having a finite jet boundary and from including ra- 
diation from elsewhere in the event. The jet boundary 
R identifies a Q- independent phase space region about 
the jet axis. As Q —J- 00, the phase space for the Z de- 
cay products to land outside of the cone falls as 1/Q. 
Hence, the jet boundary is at most a 1/Q correction to 
T2 /7~i ■ The same conclusion holds if R is defined with a 
jet algorithm other than anti-fc^. 

Next consider radiation not coming from the Z decay 
(i.e. ISR/UE). Since 7/v depends linearly on pj in ([J), 
both 71 and T 2 will be distorted by (different) shifts due 
to this contaminating radiation. If we require the fat-jet 
mass to be close to mz, then the shifts will scale as Tn, 
giving at most an 0(Q ) distortion of Ti/Ti- To turn 
this into a l/Q distortion, note that the distribution of 
contaminating radiation is smooth over the fat jet, and 
at large Q, 
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where = (0,1,0,0). Comparing n ■ Pj and minjni • 
Pj, ri2-pj}, both 71 and T2 will be shifted in the same way 
up to 1/Q corrections. Hence we can remove the leading 
effect of contamination with At from ([3]) , by defining 
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T 2 - Ar 
71 -At' 



(8) 



T21 has two important properties: first, it is close to T2/T1 
since T21 = T2/T1 if only the exact Z decay products are 
included; second, it is insensitive to jet contamination 
up to 1/Q corrections. It is crucial that the At correc- 
tion be made experimentally on an event-by-event ba- 
sis; if only the 7^/71 distribution is measured, then the 
contamination will not be a 1/Q correction. The sub- 
traction can be improved further by replacing Ar with 
At' = At(1 - \m z /Q) in the numerator of ©; the 
additional factor accounts for the average fractional dif- 
ference between T2 and 71 for uncorrelated soft radiation. 
The above logic is also appropriate for event pileup. 

To compute the T21 spectrum at leading order in 1/Q, 
we calculate 7^/71 assuming only the Z decay products 
are included in the fat jet. We then average over the 
angle 9. Using the correspondence with 2-jettiness, the 
factorization formula for 75/71 is (ljj 

= H I ^ C ^ S ^ [dsids 2 dkidk 2 S(ki, k 2 , {n,}, /i) 



00 dT21 
X J(S1,H)J(S2,H)S( 



ki + k 2 siE 2 + s 2 Ei\ 
r 2i 3 ^ ), (9) 
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2E1E2T1 
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where do is the tree-level cross-section given by the 
Z decay rate. Here H = H{mz,^), J(si,/x), and 
S(ki,k2, {rii}, fj,) are respectively the Z — > qq hard func- 
tion, inclusive jet function, and 2-jettiness soft function. 
H and J are known at 0{a 2 s ) 0, [29|]. For simplicity, we 
consider the narrow width limit, neglecting 0(T z /mz) 
corrections. We also neglect non-singular corrections at 
0(a s ). These contribute less than 5% in the peak of the 
r 2 i distribution and can be included following 

We now show that the 2-jettiness soft function S can be 
related to the hemisphere soft function Shemi — relevant 
for thrust and heavy jet mass — which is known pertur- 
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batively to 0{a 2 8 ) [30j,|3l|. The soft function is 



S(ki,k 2 ,ni ■ n 2 ,n,A) = — Y,x a <*(*i _ n i- p l) 
x S(k 2 - n 2 -P s 2 ) (0|Tjy m \X S ) (X s \ Y*X* 1°) . ( 10 ) 

where the Y's are light-like Wilson lines and P}' 2 are the 
momenta of the subjets J1.2 in the state \X S ). Rotational 
invariancc implies that the sub jet directions only appear 
in the combination n\-n%, and the argument A = Aqcd 
is a reminder of nonperturbative corrections contained in 
S. The hemisphere case corresponds to n\ ■ n 2 — 2, so 
that S hc mi(k L ,k R , /i,A) = S(k L , k R , 2, fj,, A). From (JTJ, 
the partitioning into regions of 2-subjettiness is invariant 
under a common rescaling of the sub jet direction, n\ — > 
/3ni and n% ~> /3n 2 . So f| 10[) satisfies 



S(ki,k2,ni-ri2,n, A) = (3 2 S(f3ki, fik 2 , fi 2 ni -n 2 , fx, A). 



Choosing 
we find 



) 2 sin 2 1 



m z 



(11) 



S(ki,k2,ni ■ n 2 ,/i, A) = (3% S (faki, /3gfc 2 , 2, /1, A) 
= S heml (k 1 ,k2,n/pe,k/p e ) , (12) 

where we have rescaled all dimcnsionful arguments by 
Z?^ 1 and used the fact that S has mass dimension —2. 

When fcj » A//?e, the leading nonperturbative cor- 
rection to Shomi is equivalent to a shift 32|-|34|]. fcj — > 
fcj — $//3e, where $ ~ A is Q-independent. Since T2 
in ([T]) is not identical to thrust for massive hadrons, we 
cannot use the value found in (24|. All the objects in 
(|n]) have known renormalization group equations, so we 
can sum large logarithms of T21 up to N 3 LL (with a Padc 
approximation for the small contribution of the four-loop 
cusp anomalous dimension). Thus for t 2 \ 3> 2A/(7i/3e) 
we have 

1 da fdcosO . . , . 

— -3 — =/i / — ^ — H{m z ,tJ.H)UH{rnz,lJiH,^j) 
Co « T 21 7 2 
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FIG. 2: Results of the N LL analytic calculation for T21 with 
<E> = 0. The distribution saturates for Q > 400 GeV. 



Pythia vs. N 3 LL/NNLL 
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FIG. 3: Comparison of theory prediction (bands) for T21 to 
baseline Pythia (histograms). The heavier (lighter) band is 
N 3 LL (NNLL), with widths given by factor of two variations 
of the hard, jet, and soft scales. Here, $ = 700 MeV. Arrows 
indicate the approximate range of validity of (|13[) . 
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T s(fi 



2* Si S 2 ^ (J, J (J,S\ 

"^-2^r-2^- TiZs '^'^J- (13) 



Here S T is the pcrturbativc thrust soft function, and H, 
J, and S T are fixed-order expansions in a s (/iij), a s (//j), 
and a s (/jLs) respectively. Uh and UZ ar e evolution ker- 
nels which sum a\ hr^i terms. See 23( for details. 
The natural scale choices are 



HH = m z , = [Iq V T 2i, l±s = VQT21. 



(14) 



Here \iq = 71 yl + Q 2 /(2m|) is an average over 9 of 
71 (3$ which appears in the large logarithms. For Q = 
one has /iq — mz, while for Q — > oo one has (iq = 
mz /(2\/2). We perform the si ;2 and z s integrations in 
(fT3"|) analytically and the 9 integral numerically. 

Results for the t 2 \ distribution for various Q are shown 
in Fig. [2] As anticipated, the curves rapidly approach a 
fixed distribution at large Q. 

In Fig. [3] we show a comparison to a "baseline" Pythia 
distribution, where the effects of hadronization are in- 
cluded but the Z width, finite cone size, and ISR/UE 
contamination have been turned off. For this compari- 
son we fix $ = 700 MeV to match the peak of the Q = 
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Pythia distribution, which allows us to compute the dis- 
tribution for all Q ^ 0. In the tail of the distribution, 
there is excellent quantitative agreement. The accuracy 
of Pythia's tail is somewhat artificial since it was tuned 
to closely related e + e~ thrust data at Q = 0. Predic- 
tions in the peak region require additional nonperturba- 
tive corrections, which could be included following 24 1. 

In Fig. [4(a)] we show the effect of a finite R = 1.0 
cone and jet contamination in Pythia, restricting our 
attention to jets whose mass is within a 10 GeV window 
of mz- At large Q, the effect of an R = 1.0 cone is quite 
mild. While ISR/UE give a large distortion to Ti/Tl, this 
is successfully corrected in T21 by the Ar in ([8]). With 
the Ar — > Ar' replacement we do even better. Using Ar' 
for Q = 1000 GeV, the Pythia r 2 i distribution with R = 
1.0/ISR/UE is indistinguishable at the 2% level from the 
baseline distribution shown in Fig. [3] Thus our analytic 
result agrees very well with the full Pythia distribution. 

We use Pythia to verify that the effects we have ne- 
glected in our calculation are indeed 1/Q suppressed. In 
Fig. |4(b)| we plot the Kolmogorov-Smirnov /^-statistic 
between the baseline Pythia distribution and Pythia 
as finite cone and ISR/UE effects are reinstated, as a 
function of Q. The D-statistic measures the maximum 
fractional difference between the cumulant T21 distribu- 
tions. Both finite cone and ISR/UE effects fall off as 1/Q, 
and the corrections are < 5% for Q > 400 GeV. 

In the above calculation, we neglected the finite width 
of the Z boson, which leads to 0(Tz /mz) corrections 
that are independent of Q. As shown in Fig. 4(c) finite 
width has only a small effect on the baseline distribution. 
Including Ar yields a larger effect, since © assumed that 
all deviations from the Z pole were due to jet contami- 



nation and not V z- Nevertheless, we see in Fig. 4(c) that 
Ar' still mitigates the effect of ISR/UE. Though beyond 
the scope of this letter, one can directly calculate r2i with 
finite width effects. 

It is interesting to explore analytically the Q depen- 
dence of our da/d,T2i (dropping cone and ISR/UE effects 
and taking $ = 0) by considering two extreme cases. In 
the Z rest frame Q = 0, da / 1 dri\ is equal to thrust da /dr. 
In the Q — > 00 limit, da / dT 21 depends logarithmically on 
T21 multiplied by various functions of the helicity angle 9. 
Isotropically averaging over 9, these logarithms behave as 

d cos 9 , „/ . 1 9\ , „ r ,„/, „_ 9 \ , . 

(15) 



log 



r sin — = log" — 
2 1 e 



-O(log"- 2 r) 



Thus, up to NLL order, the Q — > 00 distribution is re- 
lated to thrust by scaling by a factor ofe — 2.718..., 



da 



dr-i 



1 da , 
edr" (T 



T2i/e) 



(16) 



This is demonstrated in Fig. |4(d)| 

Our technique of treating the jet boundary and exter- 
nal radiation as 1/Q corrections can be readily general- 
ized to color neutral objects with iV-prong decays, and 
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FIG. 4: a) Effect of finite jet cone and ISR/UE in Pythia. 
The Ar' correction mitigates ISR/UE jet contamination, b) 
Fractional effect of adding finite cone and ISR/UE to the 
Pythia baseline distribution. With the Ar' correction, these 
effects are smaller than 5% for Q > 400 GeV, and scale as 
1/Q as expected, c) Effect of finite Z width, d) e-scaling 
between Q — (thrust) and Q = oo. 
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the known NNLL ingredients for the iV-jcttincss event 
shape [27j are a starting point for the calculation of N- 
subjettiness. It can also be used to compute the dis- 
tribution of individual subjet masses m„ which are di- 
rectly accessible with the iV-jettiness factorization theo- 
rem. Another straightforward generalization would be to 
incorporate massive final state quarks as in H — > bb. To 
treat colored objects like boosted top quarks (or to cal- 
culate the QCD background from light quark and gluon 
jets) requires understanding the effect of final-state ra- 
diation on substructure observables, and we anticipate 
that expanding about the Q — > 00 limit will be fruitful 
in that context as well. 
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